Blog posts tagged with open source

Improving Legal Text Analysis with Precise Sentence Boundary Detection

ALEA on Tue Apr 08 2025

Introducing NUPunkt and CharBoundary: two specialized libraries that dramatically improve sentence boundary detection in legal documents.

KL3M Data Project: Copyright-Clean AI Training Resources

ALEA on Tue Apr 15 2025

Introducing the KL3M Data Project: a comprehensive collection of legally sound training resources for large language models spanning 132+ million documents.